Comparative Benchmarking of Causal Discovery Techniques
نویسندگان
چکیده
In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference. For (b) and (c) we train causal Bayesian networks with structures as predicted by each causal discovery technique to carry out counterfactual or standard predictive inference. We compare causal algorithms on two publicly available and one simulated datasets having different sample sizes: small, medium and large. Experiments show that structural accuracy of a technique does not necessarily correlate with higher accuracy of inferencing tasks. Further, surveyed structure learning algorithms do not perform well in terms of structural accuracy in case of datasets having large number of variables. 1 ar X iv :1 70 8. 06 24 6v 2 [ cs .A I] 1 2 Se p 20 17
منابع مشابه
A novel feature selection techniques based on contrast set mining
Data classification is a challenging task in era of big data due to high number of features. Feature selection is a step in process of knowledge discovery in data that aims to reduce dimensionality and improve the classification performance. The purpose of this research is to define new techniques for feature selection in order to improve classification accuracy and reduce the time required for...
متن کاملBenchmarking Link Discovery Systems for Geo-Spatial Data
Linking geo-spatial entities is targeted only by a limited number of link discovery benchmarks. Linking spatial resources requires techniques that differ from the classical, mostly string-based approaches. In particular, considering the topology of the spatial resources and the topological relations between them is of central importance to systems that manage spatial data. Due to the large amou...
متن کاملA Study of Causal Discovery With Weak Links and Small Samples
Weak causal relationships and small sample size pose two significant difficulties to the automatic discovery of causal models from observational data. This paper examines the influence of weak causal links and varying sample sizes on the discovery of causal models. The experimental results i l lustrate the effect of larger sample sizes for discovering causal models reliably and the relevance of...
متن کاملBenchmarking Sustainability with Respect to Transportation Supply and Demand
This paper is an endeavor to quantify the concept of sustainable transportation. The prevailing idea in the context of sustainable development (SD) emphasizes on the reduction of transportation demand in order to reduce the environmental and social consequences of it. Nevertheless, in the current paper using a measure for SD, and based on the conformity of the growths of all sectors with transp...
متن کاملTowards a Framework for Dependability Benchmarking
The goal of dependability benchmarking is to provide generic ways for characterizing the behavior of components and computer systems in the presence of faults, allowing for the quantification of dependability measures. Beyond existing evaluation techniques, dependability benchmarking must provide a reproducible and cost-effective way of performing this evaluation either as stand alone assessmen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1708.06246 شماره
صفحات -
تاریخ انتشار 2017